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Summary 

An association between low serum vitamin D levels and poorer melanoma survival has been reported. We have 
studied inheritance of a polymorphism of the GC gene, rs2282679, coding for the vitamin D-binding protein, which 
is associated with lower serum levels of vitamin D, in a meta-analysis of 3137 melanoma patients. The aim was to 
investigate evidence for a causal relationship between vitamin D and outcome (Mendelian randomization). The 
variant was not associated with reduced overall survival (OS) in the UK cohort, per-allele hazard ratio (HR) for 
death 1.23 (95% confidence interval (CI) 0.93, 1.64). In the smaller cohorts, HR in OS analysis was 1.07 (95% CI 0.88, 
1.3) and for all cohorts combined, HR for OS was 1.09 (95% CI 0.93, 1.29). There was evidence of increased 
melanoma-specific deaths in the seven cohorts for which these data were available. The lack of unequivocal 
findings despite the large sample size illustrates the difficulties of implementing Mendelian randomization. 



Significance 

Mendelian randomization utilizes inheritance of a genetic variant associated with the exposure of interest (here 
the gene coding for vitamin D-binding protein) to investigate causality: the concept being that a relationship 
between rs2282679 and survival would establish that low vitamin D levels impacted directly on outcome rather 
than the association being due to confounding e.g. by healthier people spending more time outdoors. We 
show no unequivocal evidence for causality even in a large multicentre study, but we cannot exclude a variable 
(J-shaped) effect in the presence of different population vitamin D levels. The value of Mendelian 
randomization is in practice limited by the small effect of genes and conceivably by non-linear relationships. 
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Introduction 

25-Hydroxyvitamin D (henceforth referred to as Vitamin 
D) levels shortly after diagnosis were shown to be 
inversely associated with Breslow thickness and posi- 
tively associated with survival within thickness groups in 
cases recruited to the Leeds Melanoma cohort (Newton- 
Bishop et al., 2009). A subsequent small study in stage IV 
melanoma patients reported that lower vitamin D levels 
were associated with poorer survival (Nurnberg et al., 
2009), but in the absence of further published data, the 
Leeds study remains unvalidated. The role of vitamin D in 
bone health is clear, and there is much evidence in the 
literature that it plays an important role in many other 
aspects of health, including cancer survival (Goodwin 
et al., 2009) (Autier and Gandini, 2007). A meta-analysis 
of all eligible randomized controlled trials (RCTs) for 
vitamin D given for a variety of reasons showed an overall 
survival advantage for supplementation (Autier and Gan- 
dini, 2007), but as vitamin D levels tend to be higher in 
leaner fitter people, vitamin D levels, in observational 
studies at least, may simply be a marker of better health, 
rather than causally related to survival. 

There are also data suggestive of increased mortality 
associated with high vitamin D levels in a large Danish 
study of patients tested in primary care (Durup et al., 
2012) and particular concerns about supplementation in 
melanoma patients because of a theoretical risk that high 
levels of vitamin D could be harmful due to its reported 
immunosuppressive effects. We have investigated the 
viability of Mendelian randomization (Katan, 1986) as a 
method of using inherited variation in genes of known 
function (in this case variation known to affect vitamin D- 
binding protein levels and therefore vitamin D levels in the 
blood) to examine the evidence for a causal effect of 
vitamin D level on melanoma survival, given the difficul- 
ties of establishing causation in the absence of a 



randomized clinical trial (RCT). Genome-wide association 
(GWA) studies have shown that the strongest genetic 
determinant of serum vitamin D levels is the single 
nucleotide polymorphism (SNP) rs2282679, located in 
intron 1 2 of GC, the gene coding for the vitamin D-binding 
protein (VDBP) (Ahn et al., 2010; Wang et al., 2010). We 
postulated that if inheritance of the minor allele of 
rs2282679, which is associated with lower levels of 
vitamin D (Davies et al., 2011), was also associated with 
poorer outcome from melanoma, then, using the principle 
of Mendelian randomization, the observation would sup- 
port the hypothesis that vitamin D has a causal relation- 
ship with melanoma survival. SNPs in other genes in the 
vitamin D metabolism pathway have been reported to be 
associated with serum vitamin D levels (Ahn et al., 201 0). 
In a Leeds study previously reported however, these 
SNPs explained each <1 % of the variance in serum levels 
and were therefore not tested. 

Results 

The association of the rs2282679 SNP with OS in the 
Leeds melanoma cohort reported in 2009 

The minor allele of the SNP was not significantly 
associated with overall survival in the cohort reported in 
2009 (n = 800, 119 deaths, Hazard ratio (HR) for death 
1.18, 95% confidence interval (CI) 0.89, 1.56, P = 0.2 
adjusted for age, sex, site of primary; and HR 1 .23, 95% 
CI 0.93, 1.64, P = 0.1 when also adjusted for Breslow 
thickness) or in the larger Leeds melanoma cohort of 
1390 cases, (HR for death 1.10, 95% CI 0.90, 1.35, 
P = 0.3 adjusted for age, sex and site; and HR 1.11, 95% 
CI 0.91, 1.36, P = 0.3 additionally adjusting for Breslow 
thickness). When follow-up was truncated at the end of 
2009 (in view of adoption of advice to avoid vitamin D 
depletion in Leeds after 2009) the HR was similar to that 
seen in the original cohort (n = 1358, HR 1.17, 95% CI 
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Figure 1. Forest plot showing the estimates per-allele HR for SNP rs2282679 in each of the ten cohorts with and without the Leeds cohort for (A) 
Overall survival (truncated at 8 yr), (B) Melanoma specific survival. Estimates were generated from Cox proportional hazard models and adjusted 
for age, sex, site of the primary tumour and Breslow thickness. 
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Table 1. Association of the rs2282679 SNP with overall survival (truncated at 8 yr) in ten melanoma cohorts. Cox proportional hazard models 
were fitted assuming an additive effect. In the majority of cohorts, inheritance of the minor allele was associated with poorer outcome 



Minor allele No. of HR (95% Cl) b per HR (95% Cl) c per 

Centre Cases frequency 3 deaths minor allele P-value minor allele P-value 



Leeds (2008) 


800 


0.29 


119 


1.18 (0.89, 1.56) 


0.2 


1.23 (0.93, 1.64) 


0.1 


Leeds (2012) 


1390 


0.29 


233 


1.10 (0.90, 1.35) 


0.3 


1.11 (0.91, 1.36) 


0.3 


Valencia 


398 


0.33 


78 


1.26 (0.90, 1.77) 


0.2 


1.30 (0.92, 1.83) 


0.1 


Essen 


304 


0.28 


97 


1.23 (0.89, 1.70) 


0.2 


1.19 (0.86, 1.65) 


0.3 


Stockholm 


203 


0.27 


53 


1.15 (0.75, 1.77) 


0.5 


1.14 (0.74, 1.78) 


0.5 


Vienna 


208 


0.27 


18 


1.40 (0.69, 2.83) 


0.4 


1.83 (0.84, 4.01) 


0.1 


Riga 


154 


0.31 


53 


1.42 (0.93, 2.16) 


0.1 


1.37 (0.89, 2.11) 


0.2 


Tampa 


484 


0.29 


97 


0.85 (0.62, 1.18) 


0.3 


0.95 (0.68, 1.32) 


0.7 


Lund 


162 


0.26 


45 


0.47 (0.26, 0.86) 


0.01 


0.49 (0.27, 0.89) 


0.02 


Vienna FFPE 


260 


0.28 


75 


0.88 (0.60, 1.30) 


0.5 


0.82 (0.55, 1.23) 


0.3 


Athens 


164 


0.25 


18 


1.15 (0.54, 2.45) 


0.7 


1.13 (0.51, 2.47) 


0.8 


BioGenoMEL without Leeds (2008) d 


2337 




534 


1.06 (0.87, 1.28) 


0.6 


1.07 (0.88, 1.30) 


0.5 


BioGenoMEL with Leeds (2008) d 


3137 




653 


1.08 (0.91, 1.27) 


0.4 


1.09 (0.93, 1.29) 


0.3 



frequency in CEU population of G allele in HapMap = 0.26. 

b Cases adjusted for age, sex, site of primary and a single primary melanoma recruited no more than 2 yr after diagnosis. 
°Cases additionally adjusted for Breslow thickness data > 0.75 mm. 
d Meta-analysis results assume a random effects model. 



0.94, 1.48, P = 0.2 adjusted for age, sex and site; HR 
1.20, 95% CI 0.95, 1.51, P = 0.1 when also adjusted for 
Breslow thickness, data not shown). 

Meta-analysis of the association of the rs2282679 
SNP with survival in the additional BioGenoMEL 
cohorts and the Leeds cohort 

Figure 1 and Table 1 show the association of rs2282679 
with overall survival (OS) in each of the nine additional 
BioGenoMEL cohorts. For six of the nine cohorts, the 
direction of effect was to see a nonsignificantly poorer 
outcome in association with the rarer allele. We 
saw evidence of heterogeneity between the cohorts 
(Cochran's Q, P = 0.1, I 2 = 41% for all smaller cohorts, 
Cochran's Q, P = 0.1, I 2 = 37% when combined with 
the larger Leeds cohort). Contrary to the other cohorts, 
in the Lund cohort, the variant was observed to have a 
protective effect (HR 0.49, 95% CI 0.27, 0.89, 
P = 0.02). When this cohort was dropped, there was 
no evidence of heterogeneity (Cochran's Q, P = 0.6, 
I 2 = 0% for the remaining eight BioGenoMEL cohorts 
and Leeds). 

In a meta-analysis of the nine additional BioGenoMEL 
cohorts combined, the minor allele was not significantly 
associated with poorer survival under a random effects 
model (HR 1 .06, 95% CI 0.87, 1 .28, P = 0.6, adjusted for 
age, sex and site; HR 1.07, 95% CI 0.88, 1.30, P = 0.5, 
when also adjusted for Breslow thickness). 

When the additional BioGenoMEL cohorts were also 
combined with the Leeds cohort in a random effects 
model, the estimated effect did not support a significant 
association between the rare allele and poorer outcome 
(HR 1.07, 95% CI 0.88, 1.30, P = 0.5 adjusted for age, 
sex, site; HR 1.09, 95% CI 0.93, 1.29, P = 0.3 when 
additionally adjusted for Breslow thickness). There was 



no meaningful difference in the results if the larger Leeds 
cohort was used instead. 

Melanoma-specific Survival (MSS) 

We compared MSS and OS where possible (Figure 1, 
Supporting information, Table S2). For most cohorts, the 
results were similar, but for the Tampa and Essen cohorts 
the association with the minor allele and MSS was stronger 
than for OS. Overall, in the cohorts for which MSS data 
were available, there was a statistically significant 
increased risk for melanoma deaths associated with the 
SNP associated with lower vitamin D levels, HR 1 .22 95% 
C1 1 .04-1 .43, P = 0.01 , assuming a random effects model. 

Haplotype analysis 

The rs2282679 SNP was in almost perfect linkage 
disequilibrium (LD) with rs4588 in the Leeds data 
(D' = 1, R 2 = 0.99), as has been reported, and was in 
strong LD with rs7041 (D' = 1, R 2 = 0.5). We subse- 
quently typed both SNPs in the rest of the BioGenoMEL 
cohort and fitted a joint Cox regression of haplotypes 
Gc1s and Gc2 for overall survival taking the ancestral 
Gc1f haplotype as baseline (Table 2, Figure 2). 

Neither analysis showed a statistically significant asso- 
ciation with outcome, but the observed effect sizes were 
larger than for the single SNP analysis and the direction of 
effect was consistent with our hypothesis regarding the 
effect of vitamin D. We saw evidence that possessing 
more copies of the Gc1s variant is associated with a poor 
outcome (HR 1.17 per haplotype adjusted for age, sex, 
site, Breslow, Gc2, 95% CI 0.95, 1.43, P = 0.1). Pos- 
sessing more copies of the Gc2 haplotype was associ- 
ated with an even poorer outcome (HR 1 .28 per haplotype 
adjusted for age, sex, site, Breslow, Gc1s, 95% CI 0.88, 
1.86, P = 0.2). 
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Table 2. Association of the per haplotype effect of Gc1s and Gc2 with overall survival (truncated at 8 yr) in nine melanoma cohorts. Cox 
proportional hazard models were fitted assuming an additive effect. In the majority of cohorts, inheritance of the minor allele was associated with 
poorer outcome. Both Gc1 s and Gc2 were jointly fitted in a multivariable model to look at the effect of each haplotype adjusted for the other two. 



Gc1s (baseline Gc1f) Gc2 (baseline Gc1f) 





Number of 


HR (95% Cl) a per 




HR (95% Cl) a per 




Centre 


cases (deaths) 


minor allele 


P-value 


minor allele 


P-value 


Leeds (2008) 


783 (120) 


1 .10 (0.76, 1 .59) 


0.6 


1 .31 (0.87, 1 .96) 


0.2 


Valencia 


373 (76) 


1.13 (0.67, 1.90) 


0.7 


1.41 (0.81, 2.47) 


0.2 


Essen 


244 (76) 


1.38 (0.83, 2.31) 


0.2 


1.45 (0.81, 2.61) 


0.2 


Vienna 


189 (18) 


3.58 (0.92, 13.91) 


0.07 


4.11 (1.01, 16.79) 


0.05 


Riga 


151 (51) 


1.72 (0.71, 4.19) 


0.2 


2.32 (0.89, 6.04) 


0.09 


Lund 


1 57 (43) 


0.59 (0.30, 1.15) 


0.1 


0.29 (0.12, 0.70) 


0.006 


Vienna FFPE 


241 (68) 


1.15 (0.69, 1.94) 


0.6 


0.85 (0.47, 1.53) 


0.6 


Barcelona 


259 (39) 


1.42 (0.59, 3.40) 


0.4 


2.12 (0.89, 5.06) 


0.09 


Athens 


168 (15) 


1.32 (0.39, 4.48) 


0.7 


1.34 (0.31, 5.83) 


0.7 


Overall 11 


2565 (506) 


1.17 (0.95, 1.43) 


0.1 


1.28 (0.88, 1.86) 


0.2 



a Cases adjusted for age, sex, site of primary, the other haplotype (Gc1s or Gc2) and Breslow thickness data. Cases have a single primary 
melanoma recruited no more than 2 yr after diagnosis. 
b Meta-analysis results assume a random effects model. 
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Figure 2. Forest plot showing the per-haplotype HR estimates for Gc1s and Gc2 in nine cohorts for overall survival (truncated at 8 yr). Estimates 
were generated from Cox proportional hazard models adjusted by age, sex, site, Breslow thickness and the other two haplotypes. 



As with the rs2282679 SNP, a strong effect is seen for 
the Lund cohort in the opposite direction. (HR = 0.59 per 
haplotype of Gc1s, 95% CI 0.30, 1.15, P = 0.1, adjusted 
for age, sex, site, Breslow, Gc2 and HR = 0.2 per 
haplotype of Gc2, 95% CI 0.12, 0.70, P = 0.006, adjusted 
for age, sex, site, Breslow, Gc1s). No significant heter- 
ogeneity was seen for the Gc1 s estimates (Cochrane's Q, 
P = 0.4, l 2 =1%). However, substantial heterogeneity 
was seen for the Gc2 estimates (Cochrane's Q, P = 0.02, 
I 2 = 56%), which was removed if the Lund cohort was 
omitted (Cochrane's Q = 0.4, I 2 = 1 %). 

Testing the assumption that the GC SNP is 
independent of survival, conditional on vitamin D 
levels 

A key assumption of Mendelian randomization is that the 
genetic variant only influences survival through its effect 
on the exposure variable (see Data S1, Figure S2). In the 
original Leeds cohort for cases with measured serum 
levels (n = 761), adjusting for vitamin D did reduce the 
observed effect size (HR 1.19, 95% CI 0.89, 1.58, P = 0.2 



adjusted for age, sex and site; HR 1.09, 95% CI 0.82, 
1.45, P = 0.6 when additionally adjusted for serum 
vitamin D as a continuous measure and season of blood 
draw, data not shown), suggesting that GC may have 
effects independent of its effect on vitamin D which 
would invalidate the Mendelian randomization approach. 

Analysis of cases with primaries in sun-protected 
body sites 

The reported association between rs2282679 and sur- 
vival was stronger for melanomas arising in sun-pro- 
tected sites than overall (Table S3) in the original Leeds 
cohort (HR 2.83, 95% CI 1.03, 7.73, P = 0.04). The 
Essen, Vienna FFPE and Valencia cohort data were 
similar (HR 3.48, 95% CI 0.65, 18.54, P = 0.1, HR 4.13, 
95% CI 1.07, 15.89, P = 0.04 and HR 1.63, 95% CI 0.76, 
3.47, P = 0.2 respectively). None of the other cohorts 
have enough rare cases to produce reliable estimates, 
though there was some evidence of a deleterious effect 
of the variant allele in the limited data from the Tampa 
cohort. 



© 2013 The Authors. Pigment Cell St Melanoma Research Published by John Wiley & Sons Ltd 



237 



Davies et al. 



Discussion 

We have previously reported an association between low 
serum levels of vitamin D soon after diagnosis and 
increased risk of death for melanoma patients (Newton- 
Bishop et al., 2009). The effects of vitamin D are 
pleiotropic, but vitamin D is clearly demonstrated to be 
antiproliferative for melanoma in vitro (Essa et al., 2010) 
and to promote cellular differentiation. We have looked at 
the association between inheritance of a variant in the GC 
gene encoding vitamin D-binding protein (VDBP) (associ- 
ated with lower levels of vitamin D) and survival, to test 
for a causal relationship between vitamin D and survival. 
Although this gene is the strongest known genomic 
predictor of serum vitamin D levels in GWA studies, it 
does explain only 4% of the variance in blood levels of 
vitamin D (Davies et al., 2011). 

The strength of the study is the size of the Leeds 
cohort and validation in multiple melanoma cohorts from 
Europe and North America, although some of those 
cohorts are individually small. A weakness of the study is 
that we could only use OS as the end point for outcome 
overall as melanoma specific survival data were not 
available for all cohorts, though in most cases the 
estimates under MSS and OS were similar. We did see 
a statistically significant association between the SNP and 
increased death from melanoma in the MSS analysis. 
However, we did not have MSS data from the Lund 
cohort, the cohort that contributed the majority of the 
heterogeneity observed in the OS analysis, and we are 
currently therefore unable to assess whether the Lund 
cohort would have a similar effect for MSS. A second 
weakness is that, in comparison to phenotypic factors 
such as Breslow thickness, genetic variants are likely to 
have only weak effects on outcome so that power is 
always an issue for studies of this type. 

The haplotype analysis (for which functional data are 
published) was supportive of a relationship between the 
gene and survival, as there was a trend towards 
increased survival in those individuals who express one 
or more copies of the Gc1f protein isoform though no 
significant associations were found. We looked at this 
isoform as it has been shown, in vivo, to be more 
abundant in the plasma, to have a higher binding affinity 
for vitamin D metabolites and to be associated with 
higher serum vitamin D levels. Thus, this is consistent 
with the previous finding of this group that higher vitamin 
D levels, measured at diagnosis, are associated with 
increased survival times, but the result is not conclusive. 
In our secondary analysis of the larger Leeds cohort 
followed up after advice was given to avoid vitamin D 
depletion was recommended in 2009, the reported 
association of the variant was weaker. 

There was some variation in the association between 
rs2282679 and survival between cohorts, which might be 
due to differences between populations, to differences in 
study design or to chance. The only significant outlier was 



the Lund cohort. When the Lund cohort is excluded, a 
significant association is seen between rs2282679 and 
outcome in the remaining cohorts (HR 1.15, 95% CI 1 .01 , 
1 .31 , P = 0.03). Similarly, if the Lund cohort is dropped, a 
significant association is seen between the number of 
copies of Gc2 present and outcome (HR = 1 .39, 95% CI 
1.1, 1.76, P = 0.006, adjusted as above) and the number 
of copies of Gc1s and outcome (HR = 1.25, 95% CI 1.01, 
1.54, P = 0.04, adjusted as above). Here, we consider 
possible explanations as to why the Lund cohort pro- 
duces results so different from the other cohorts. The 
most likely, given the small size of the Lund cohort (162 
cases), was that the inconsistent result is due to random 
variation. The Lund cohort was also recruited much earlier 
than other cohorts so that it may be that there is some 
unknown effect of study entry date on the association of 
the SNP with outcome that we have been unable to 
control for. However, it is also known that serum vitamin 
D levels in Scandinavia are the highest in Europe due to a 
diet rich in fish and mandatory fortification of foods 
(McKenna, 1992). In a population-based prospective 
cohort study of men living in Malmo (Southern Sweden 
near to Lund) born between 1923-1945, recruited in 
1991-1996, designed to look at diet and cancer risk 
(Brandstedt et al., 2012), reported levels of vitamin D in 
the serum were extremely high compared to other 
populations. The average vitamin D level in the Swedish 
prostate cancer cases was 87.4 nmol/l and 86.4 in 
controls. In the Leeds melanoma cohort, the average 
level around diagnosis was 57 nmol/l in population con- 
trols and 60 in sibling controls (Davies et al., 201 1). If high 
vitamin D levels were deleterious in melanoma as a result 
of immunosuppression, in a population with high dietary 
vitamin D, then a biological explanation for the difference 
between the Lund data and the others could conceivably 
be that in those circumstances inheritance of a genetic 
variant, further increasing levels might be associated with 
higher mortality. This biological hypothesis seems less 
likely than the possibility that the result in the Lund 
samples was a random event, but as vitamin D supple- 
mentation is common in the absence of measurement of 
levels, it is important to consider the possibility that high 
as well as low levels of vitamin D might be deleterious. 

We report evidence of an effect on survival for 
melanoma patients whose primary was in sun-protected 
sites, for example, acral lentiginous tumours. This com- 
parison was carried out because, although in case-control 
comparisons previously reported for serum vitamin D 
levels we showed no overall evidence that melanoma 
patients had lower levels than controls (Newton-Bishop 
et al., 2011), this sub-group in the Leeds cohort did have 
lower levels (paper in preparation). 

Even taking into account the size of our large dataset by 
melanoma cohort standards, insufficient power remains 
problematic. Using results from our previous analyses, 
we are able to quantify the magnitude of this problem; we 
have previously shown in the Leeds melanoma cohort 
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that there is a mean decrease of 5.7 nmol/l serum vitamin 
D levels per allele of rs2282679 (Davies et al., 201 1) and 
that the hazard ratio for death is 0.83 for an increase of 
20 nM (for OS) (Newton-Bishop et al., 2009). From these 
estimates, we would expect the per-allele effect of the 
SNP on survival to have a HR of 1.05, assuming the SNP 
had no effect on survival independent of its influence on 
serum vitamin D. Assuming that 20% of cases will die 
and a minor allele frequency (MAF) of 0.26, we would 
require over 42 000 cases to have 80% power to 
investigate an association of this magnitude and over 
20 000 cases to have 50% power. This demonstrates 
that while Mendelian Randomization is in theory an 
excellent tool for investigating causality, since genetic 
effect sizes are usually modest, very large cohort sizes 
would be required to reach definitive conclusions. Our 
findings corroborate those of a recent analysis, where a 
similar approach was used to determine a causal role for 
vitamin B12 in the risk of developing prostate cancer 
(Collin et al., 2011). The authors determined that to have 
90% power to detect a genotype-cancer association they 
would have needed 65 000 cases and 65 000 controls 
assuming the B12-prostate cancer association they had 
previously reported was not due to confounding. A 
Mendelian Randomization study has recently successfully 
been conducted where a variant at the FTO locus was 
shown to be associated with multiple cardiovascular 
disease related outcomes, containing data from 198 502 
individuals (Fall et al., 2013). For diseases such as 
melanoma, however, recruiting so many cases is not 
currently feasible. An alternative method for boosting 
power is to combine a set of genetic variants that 



captures more of the variation in the phenotype of 
interest. We have presented an analysis where we 
looked at haplotypes of GC, the strongest known deter- 
minant of vitamin D levels. Even though the environment 
largely determines vitamin D levels, greater understand- 
ing of variation in genes involved in vitamin D metabolism 
may eventually lead to the development of a set of 
variants that has enough power to overcome the limita- 
tions of Mendelian Randomization in this setting. 

Methods 

Data collection 

The Leeds melanoma cohort 

Population-ascertained incident melanoma cases were recruited to 
an ethically approved case-control study in a geographically defined 
area of the UK (Yorkshire and the Northern region south of the River 
Tyne, 67% participation rate); 960 cases (aged 18 to 76 yr) were 
recruited from September 2000 to December 2005 (Falchi et al., 
2009; Randerson-Moor et al., 2009). Additional cases were recruited 
from 2005 to 2012 to build the Leeds melanoma cohort. Since 
January 2007, cases have also been recruited from outside Yorkshire 
to enhance for participants having had a sentinel node biopsy and for 
melanomas arising in sun-protected sites. As of March 2012, there 
were 21 12 cases recruited, the majority of whom are still undergoing 
sample and data collection. 

In this study, we present analyses in 800 of the cases in which we 
reported an association of serum vitamin D previously. Cases chosen 
for analysis had tumours thicker than 0.75 mm, a single primary 
melanoma and complete follow-up data available at the time of 
analysis. We report a secondary analysis in the larger cohort of 1390 
with complete data in March 2012; interpretation of the results in the 
full dataset is complicated by the fact that vitamin D supplementation 
was recommended to many cases at the end of 2009, and it is 



Table 3. Cases eligible for analysis in the cohorts that comprise the meta-analysis of the association of the GC SNPs with outcome. Cases in each 
column also meet the criteria of all conditions to the left of it 



Incident Cases with complete data on 

case Number of adjusting covariates Number Cases not 

Whole (recruited Cases cases with with Cases with in sun 

cohort <2 yr after genotyped a single Breslow > complete protected 

Centre size diagnosis) for the SNP melanoma Age Sex Site Breslow 0.75 mm follow-up sites 



Leeds 3 (2008) 


1157 


1085 


1031 


1006 


1006 


1006 


1003 


991 


819 


800 


751 


Valencia 


1440 


1248 


654 


654 


654 


654 


640 


585 


399 


398 


342 


Essen 


941 


643 


630 


615 


574 


574 


490 


392 


304 


304 


281 


Stockholm 


870 


605 


222 


222 


222 


222 


222 


222 


203 


203 


203 


Vienna 


1085 


389 


251 


251 


242 


242 


242 


225 


212 


208 


195 


Riga b 


243 


242 


200 


199 


199 


199 


199 


171 


155 


154 


149 


Tampa 


585 


585 


505 


505 


505 


505 


503 


500 


484 


484 


463 


Lund 


355 


355 


338 


338 


338 


338 


327 


327 


162 


162 


160 


Vienna FFPE 


302 


302 


286 


279 


276 


276 


276 


273 


260 


260 


244 


Athens 


200 


200 


170 


170 


170 


170 


170 


170 


165 


164 


164 


Barcelona 


398 


358 


330° 


294 


280 


279 


277 


268 


259 


259 


250 


Validation set 


6419 


4927 


3586 


3527 


3460 


3459 


3346 


3133 


2603 


2596 


2451 


total 
























Leeds (2012) 


2112 


1996 


1699 


1661 


1661 


1661 


1661 


1619 


1430 


1390 


1289 



a This dataset was the initial Leeds cohort recruited up until 2006 and followed up until Nov 2008. 

b One case dropped because it had an improbably large recorded Breslow thickness that greatly affected the linear relationship of Breslow 
thickness with outcome in the multivariable model. 
c Cases genotyped for both rs7041 and rs4588. 
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unknown how many individuals followed this advice and to what 
degree. In previous studies, supplementation was an even more 
potent predictor of vitamin D levels than the inheritance of the 
rs2282679 SNP in the GC gene, which is strongly associated with 
serum levels (Davies et al., 2011). 

Recruitment (and therefore blood sampling) took place, wherever 
possible, within a period of 3-6 months after diagnosis. Season of 
serum vitamin D sampling was calculated from the date of blood 
draw and grouped as a categorical variable by month: Jan-Mar, Apr- 
Jun, Jul-Sept and Oct-Dec. Relapse/survival data are collected by 
annual participant questionnaire, contact (bi-annual survey) with 
family doctors, extraction of data from clinical notes and tagging 
with the cancer registry, which produces regular updates of death 
registration and new cancer diagnoses. OS and melanoma-specific 
survival (MSS) data are available. The Leeds cohort was analysed at 
two specific time points: when the association with vitamin D was 
first reported (15) and the most recent update in March 2012. The 
original dataset consisted of cases with tumours thicker than 
0.75 mm that were recruited before late 2006 and followed up until 
November 2008. In this analysis, but not in the original report, we 
also excluded from all analyses cases with multiple melanomas and 
those who were recruited 2 yr or more after diagnosis to produce 
consistency across all cohorts. 

BioGenoMEL cohorts 

Eleven cohorts, the Leeds cohort plus ten additional cohorts from 
Europe and the USA, contributed data to this analysis (Table 3). 
Additional details can be found in Data S1. 

Follow-up data are collected in the additional cohorts in a similar 
manner to the Leeds melanoma cohort with some local variations 



(Table S1). Only seven groups had MSS data and genotyping data for 
the rs2282679 SNP, so OS was used in the meta-analysis of these 
data sets. 

There was concern that cases with primary melanomas arising in 
sun-protected body sites may have a different biology to those in 
sun-exposed sites. Where possible, these cases were also analysed 
in a separate subanalysis. 

There was some variation between the cohorts, as has been 
reported before (Davies et al., 201 1). Figure 3 shows variation in the 
Breslow thickness distribution for each of the cohorts, showing that 
thicker tumours were removed in the Riga cohort and (to a lesser 
degree) in the Essen cohort, which recruited from a clinic serving as a 
referral centre within the Ruhr region of Germany. Figure 4 shows 
Kaplan-Meier estimates for survival in each of the ten cohorts. The 
Kaplan-Meier curves are relatively similar in most centres, with the 
exception of the Riga and Essen cohorts where the survival curve is 
much steeper, as was expected given the thicker tumours at 
presentation. Only cases with invasive tumours thicker than 
0.75 mm were included in the analysis. 

SNP genotyping 

The intronic GC SNP rs2282679 was genotyped in ten BioGenoMEL 
cohorts; residual DNA was insufficient to type this SNP in the 
Barcelona cohort. We subsequently typed two functional SNPs 
rs4588 and rs7041 in the Leeds cohort and nine of the BioGenoMEL 
cohorts (not including Tampa), the rs4588 SNP was poorly typed in 
the Stockholm cohort (call rate = 71%) so these data were dis- 
carded. These SNPs are both exonic and in strong linkage disequi- 
librium (LD) with the rs2282679 SNP and each other. The GC gene is 
highly polymorphic, leading to three common VDBP isoforms, 
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Figure 3. Box plots showing variation of 
Breslow thickness, age of diagnosis and 
date of study entry in each of the ten 
cohorts. Individual patient data was not 
available for the Tampa cohort. For the 
Lund cohort date of study entry was not 
available so date of diagnosis is presented; 
date of study entry was within 2 yr of 
diagnosis. 
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Figure 4. Kaplan-Meier estimator of 
overall survival (truncated at 8 yr) for each 
of the ten cohorts. Follow-up is truncated at 
8 yr post-diagnosis. 
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termed Gc1 f, Gc1 s and Gc2, as well as over 1 20 rarer variants (Cleve 
and Constans, 1988). Variation in rs4588 and rs7041 is responsible 
for generating the three common isoforms of the GC protein (see 
Supporting information for further details). Serum levels of vitamin D 
metabolites are highest for Gc1f and lowest for Gc2 (Lauridsen et al., 
2005). Similarly, binding affinity for vitamin D metabolites is also 
highest for Gc1f and lowest for Gc2 (Arnaud and Constans, 1993). 
The minor allele of rs2282679 is found exclusively in the Gc2 isoform 
(Figure S1). Different haplotypes are therefore associated with higher 
or lower levels of vitamin D, and we investigated the association of 
these two additional SNPs and the haplotypes in the Leeds cohort 
with survival. 

Statistical analysis 

Link-anonymized data were centralized in Leeds from the ten 
European BioGenoMEL cohorts. Analyses were carried out in a 
similar manner separately by the Tampa group to comply with their 
institutional review board approval. Initial analyses were carried out in 
the Leeds data alone, and validation was subsequently sought in the 
other data sets. 

Survival time was defined as the period between the date of 
surgical excision of the primary and date of death or last date of 
follow-up (at which point records were censored). Multivariate 
survival analyses were performed using Cox's proportional hazards 
model. Models were fitted using the 'coxph' routine in the 'survival' 
package in R 2.10.1. Hazard ratio estimates were calculated for the 
effect of the rs2282679 SNP (and subsequently two closely related 
SNPs, rs7041 and rs4588) on OS adjusted for sex, site of melanoma 
(head/neck, trunk, limbs or other) and age at diagnosis. Since vitamin 
D levels were reported to be associated with Breslow thickness, we 
also ran parallel analyses additionally adjusting for thickness. MSS 
data were not available from all groups; therefore, to reduce the 
number of non-melanoma-related deaths reported, we truncated OS 
time after 8 yrs of follow-up. A log-additive genetic model was 
assumed and the 'per-allele' hazard ratios were estimated. 

Additional checks were made to ensure that the formalin-fixed 
paraffin-embedded (FFPE) Vienna dataset was viable as an indepen- 
dent cohort. Normal DNA for cases in the FFPE Vienna cohort was 
extracted from archived material independent of patient survival. 
Kaplan-Meier estimates of survival were similar to those of the other 
participating cohorts (Figure 4). Violations of Hardy-Weinberg equi- 
librium were tested for the genotyped SNP, and no evidence of 



deviation was detected (P = 0.8). The cohort was therefore judged 
acceptable. 

Relevant point estimates and standard errors for each study were 
taken from the fitted Cox's proportional hazards models. These data 
were then used to carry out a random effects meta-analysis using the 
'forestplot' function in the 'rmeta' package in R. Models were 
reported adjusted only for age of diagnosis, sex, site and with/ 
without Breslow thickness. We also compared the association of the 
SNP for OS and MSS in the seven cohorts where cause of death was 
available. 

As reported previously (Davies et al., 2012), we did not adjust for 
tumour ulceration or American Joint Committee on Cancer (AJCC) 
stage because of incomplete data for ulceration across all the 
cohorts, pending central review of all pathology. More details of 
approaches taken to analysis of multiple cohorts can be found in 
Davies et al. (Davies et al., 2012). 

For the haplotype analysis of the two closely related variants in the 
GC gene in the Leeds cohort, LD was calculated using the 'pwld' 
routine in Stata version 10. Since rs7041 and rs4588 are in very 
strong LD (D' = 1) and are known to code specifically for three 
isoforms, we expect that the phase for double heterozygotes is 
known. To test this formally, we calculated haplotype frequencies in 
the Leeds Melanoma Cohort data using an expectation-maximization 
(EM) algorithm implemented in the 'hapipf routine in Stata. This 
confirmed that only the three haplotypes Gc1f, Gc1s and Gc2 were 
seen in these data. 

We fitted a Cox regression model for the per-haplotype associa- 
tion of Gc1s and Gc2 with overall survival adjusted for the other two 
haplotypes for each of the BioGenoMEL cohorts for which haplotype 
data were available. We constructed forest plots to test for a 
combined association in all cohorts as above. 

The proportional hazards assumption was tested using the 'phtest' 
function in Stata, looking for an association of the Schoenfeld residuals 
with time in the global model and for each covariate. Heterogeneity 
measures were calculated using the 'metaan' function in Stata. Power 
analysis was conducted using the 'stpower' function in Stata. 
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© 2013 The Authors. Pigment Cell & Melanoma Research Published by John Wiley & Sons Ltd 



243 



